Model Selection

Instruction Fine-tuning

# Instruction Fine-tuning

Kakaocorp.kanana 1.5 8b Instruct 2505 GGUF

Kanana-1.5-8B-Instruct-2505 is an 8B-parameter instruction fine-tuned language model developed by Kakao Corp, suitable for text generation tasks.

Large Language Model

Marin Community.marin 8b Instruct GGUF

marin-8b-instruct is an 8B-parameter-scale instruction fine-tuned language model suitable for text generation tasks.

Large Language Model

Allenai.olmo 2 0425 1B Instruct GGUF

OLMo-2-0425-1B-Instruct is a 1-billion-parameter instruction-finetuned language model developed by AllenAI, focused on text generation tasks.

Large Language Model

Olmo 2 0425 1B Instruct GGUF

OLMo 2 1B Instruct Edition is a post-training variant of the OLMo-2-0425-1B-RLVR1 model, optimized through supervised fine-tuning, DPO training, and RLVR training to achieve state-of-the-art performance across multiple tasks.

Large Language Model English

Josiefied Qwen3 4B Abliterated V1 Gguf

This is the GGUF quantized version of the Josiefied-Qwen3-4B-abliterated-v1 model, suitable for local deployment and execution.

Large Language Model

Goekdeniz-Guelmez

Olmo 2 0425 1B Instruct

OLMo 2 1B is a post-training variant of the allenai/OLMo-2-0425-1B-RLVR1 model, undergoing supervised fine-tuning, DPO training, and RLVR training, aiming to achieve state-of-the-art performance across multiple tasks.

Large Language Model

Transformers English

Stablelm Zephyr 3b GGUF

StableLM Zephyr 3B is a 3-billion-parameter instruction-tuned model trained on public datasets, synthetic datasets, and Direct Preference Optimization (DPO), delivering excellent performance.

Large Language Model English

Gemma 2 9b It Abliterated GGUF

A quantized version based on Gemma 2.9B, optimized using llama.cpp, suitable for running in LM Studio.

Large Language Model English

Badger Writer Llama 3 8b

Badger Writer is a normalized Fourier task superposition model based on multiple Llama 3 8B models, specializing in text generation tasks, particularly excelling in creative writing and instruction following.

Large Language Model

Gemma 2 Llama Swallow 27b It V0.1

A Japanese-enhanced large language model based on the Gemma-2 architecture, significantly improving Japanese capabilities while retaining original English proficiency

Large Language Model

Transformers Supports Multiple Languages

Gemma 2 Llama Swallow 2b It V0.1

The Gemma-2-Llama-Swallow series is built through continued pre-training of the gemma-2 model, significantly enhancing Japanese language processing capabilities while retaining original English proficiency.

Large Language Model

Transformers Supports Multiple Languages

OLMo 2 1B is the smallest model in the open language model series released by the Allen Institute for Artificial Intelligence, based on OLMo-mix-1124 pre-training and further trained with the Dolmino-mix-1124 dataset during the intermediate training phase.

Large Language Model

Transformers English

Videochat R1 Thinking 7B

VideoChat-R1-thinking_7B is a multimodal model based on Qwen2.5-VL-7B-Instruct, focusing on video-text-to-text tasks.

Transformers English

Multilingual E5 Large Instruct Q8 0 GGUF

Multilingual E5 large instruction model, supporting text embedding and classification tasks in multiple languages with strong cross-language capabilities.

Large Language Model Supports Multiple Languages

R01 Gemma 3 1b It

Gemma 3 is a lightweight open-source multimodal model introduced by Google, built on the same technology as Gemini, supporting text and image inputs to generate text outputs.

Transformers English

This model is a text generation model fine-tuned based on Qwen2.5-14B-Instruct and trained using the TRL library.

Large Language Model

Toastypigeon Gemma 3 Starshine 12B GGUF

A creative writing model based on Gemma 3 12B, excelling in narration and scene construction with a novelistic style

Large Language Model English

Allura Org Gemma 3 Glitter 4B GGUF

GGUF format model file converted from allura-org/Gemma-3-Glitter-4B, optimized with imatrix quantization

Large Language Model English

Doge 320M Instruct

Doge 320M Instruct is a lightweight language model based on dynamic masked attention, trained with supervised fine-tuning (SFT) and direct preference optimization (DPO), suitable for question-answering and dialogue tasks.

Large Language Model

Transformers English

Thedrummer Fallen Gemma3 4B V1 GGUF

This is a quantized version of TheDrummer/Fallen-Gemma3-4B-v1 model, processed using llama.cpp, suitable for text generation tasks.

Large Language Model

Mistral Small 3.1 24b Instruct 2503 Hf

Mistral Small 3.1 Instruct 24B is a large language model based on instruction fine-tuning, focusing on text generation tasks.

Large Language Model

Qwen2.5 Bakeneko 32b Instruct V2

An instruction-tuned variant based on Qwen2.5 Bakeneko 32B, enhanced with Chat Vector and ORPO optimization for improved instruction-following capabilities, excelling in Japanese MT-Bench.

Large Language Model

Transformers Japanese

Teacher Persona GGUF

Qwen2-1.5B-Instruct is a 1.5 billion parameter instruction fine-tuned large language model released by Alibaba Cloud, suitable for Q&A and dialogue tasks.

Large Language Model

T3Q Qwen2.5 14b V1.2 E2

T3Q-qwen2.5-14b-v1.2-e2 is a post-trained version based on the Qwen/Qwen2.5-14B-Instruct-1M model, using LoRA-8-4-0.0001-cosine-32-16 configuration and trained on train_data_v1.2.

Large Language Model

Transformers Supports Multiple Languages

T3Q Qwen2.5 14b V1.0 E3 Q4 K M GGUF

This is a quantized model based on Qwen2.5-14B-Instruct-1M, converted to GGUF format, suitable for the llama.cpp framework.

Large Language Model Supports Multiple Languages

Gemma 3 12b Novision

A text-only version converted from google/gemma-3-12b-it, with visual components removed, focusing on text generation tasks

Large Language Model

Google.gemma 3 4b It GGUF

Gemma 3.4B IT is a 3.4 billion parameter large language model developed by Google, focusing on the instruction-tuned version, suitable for various natural language processing tasks.

Large Language Model

TraceBack 12b is a 4bit quantized version based on the Mistral-Nemo-Instruct architecture, focusing on instruction-following and chain-of-thought reasoning tasks.

Large Language Model

Llama 3.1 8b Medusa V1.01

An 8B-parameter language model based on the Llama 3.1 architecture, created by merging multiple specialized models, excelling in text generation tasks.

Large Language Model

Kanana Nano 2.1b Instruct

Kanana is a bilingual (Korean/English) language model series developed by Kakao. This 2.1B parameter version outperforms similar models in Korean while maintaining efficient computational costs.

Large Language Model

Transformers Supports Multiple Languages

Hiber Multi 10B Instruct

Hiber-Multi-10B-Instruct is an advanced multilingual large language model based on Transformer architecture, supporting multiple languages with 10 billion parameters, suitable for text generation tasks.

Large Language Model

Transformers Supports Multiple Languages

Huihui Ai.qwen2.5 14B Instruct 1M Abliterated GGUF

A 14B-parameter large language model focused on instruction-following tasks, supporting text generation capabilities.

Large Language Model

Nousresearch DeepHermes 3 Llama 3 8B Preview GGUF

A dialogue model fine-tuned based on Llama-3-8B, supporting multiple quantization versions, suitable for tasks such as chatting, reasoning, and role-playing.

Large Language Model English

Guardreasoner 1B

GuardReasoner 1B is a version fine-tuned via R-SFT and HS-DPO based on meta-llama/Llama-3.2-1B, focusing on classification tasks for analyzing human-AI interactions.

Large Language Model

Transformers English

Guardreasoner 8B

GuardReasoner 8B is a fine-tuned model based on meta-llama/Llama-3.1-8B, specializing in reasoning-based LLM safety protection

Large Language Model

Mistral Small 24B Instruct 2501 GGUF

GGUF quantized version of Mistral-Small-24B-Instruct-2501, suitable for local deployment and text generation tasks.

Large Language Model

Deepseer R1 Vision Distill Qwen 1.5B Google Vit Base Patch16 224

DeepSeer is a vision-language model developed based on the DeepSeek-R1 model, supporting chain-of-thought reasoning and trained through dialogue templates for visual models.

mehmetkeremturkcan

Lake 1 Advanced

Mistral-7B-Instruct-v0.3 is a large language model fine-tuned for instruction following based on Mistral-7B-v0.3, supporting function calls and extended vocabulary.

Large Language Model

A vision-language model based on Microsoft's Phi-1.5 architecture, combined with CLIP for image processing capabilities

Transformers Supports Multiple Languages

A multimodal large language model developed based on the paper 'Task Preference Optimization: Improving Multimodal Large Language Models through Visual Task Alignment'

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase